An Improved System for Exon Recognition and Gene Modeling in Human DNA Sequence

نویسندگان

  • Yin Xu
  • J. Ralph Einstein
  • Richard J. Mural
  • Manesh J. Shah
  • Edward C. Uberbacher
چکیده

A new version of the GRAIL system (Uberbacher and Mural, 1991; Mural et al., 1992; Uberbacher et al., 1993), called GRAIL II, has recently been developed (Xu et al., 1994). GRAIL II is a hybrid AI system that supports a number of DNA sequence analysis tools including protein-coding region recognition, PolyA site and transcription promoter recognition, gene model construction, translation to protein, and DNA/protein database searching capabilities. This paper presents the core of GRAIL II, the coding exon recognition and gene model construction algorithms. The exon recognition algorithm recognizes coding exons by combining coding feature analysis and edge signal (acceptor/donor/translation-start sites) detection. Unlike the original GRAIL system (Uberbacher and Mural, 1991; Mural et al., 1992), this algorithm uses variable-length windows tailored to each potential exon candidate, making its performance almost exon length-independent. In this algorithm, the recognition process is divided into four steps. Initially a large number of possible coding exon candidates are generated. Then a rule-based prescreening algorithm eliminates the majority of the improbable candidates. As the kernel of the recognition algorithm, three neural networks are trained to evaluate the remaining candidates. The outputs of the neural networks are then divided into clusters of candidates, corresponding to presumed exons. The algorithm makes its final prediction by picking the best canadidate from each cluster. The gene construction algorithm (Xu, Mural and Uberbacher, 1994) uses a dynamic programming approach to build gene models by using as input the clusters predicted by the exon recognition algorithm. Extensive testing has been done on these two algorithms.(ABSTRACT TRUNCATED AT 250 WORDS)

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Novel Mutations in IL-2 Gene in Khorasan Native Fowls

The intron-exon structure of Khorasan native fowl interleukin-2 (IL-2) was investigated. For this purpose, twenty chickens were selected from the Native Fowl Breeding Station of Khorasan province, and genomic DNA was extracted using a modified conventional DNA extraction protocol. An 875 bp fragment of IL-2 was successfully amplified, including a small part of the promoter, exon 1, intron 1, an...

متن کامل

O-36: Evaluation of Genetic Variations in Intron 4 and Exon 5 of RABL2B Gene in Infertile Men with Oligoasthenoteratospermia and Immotile Short Tail Sperm Defects

Background One of the main causes of male infertility is defect in structure and function of sperm cells. Infertile men with oligoasthenoteratospermia (OAT) defect, have sperms with abnormalities in count, motility and morphology. Patients with immotile short tail sperm (ISTS) disorder have immotile short-tailed sperm with disorganized axonem, and a significant decrease in sperm counts. Numerou...

متن کامل

Sequencing and Bioinformatics Analysis of Kappa-Casein Exon 4 Gene in Iranian Bacterianus and Dromedaries Camels

Kappa-casein, as a major protein component in mammalian milk, plays an essential role in formation and stabilization milk micelles and preventing them from aggregating and therefore, helping to keep calcium phosphate in solution and transfer of calcium and phosphors from animal milk to consumers. Therefore, the objective of the current study was to investigate genetic and phylogenetic analysis ...

متن کامل

Isolation and Characterization of a New Peroxisome Deficient CHO Mutant Cell Belonging to Complementation Group 12

We searched for novel Chinese hamster ovary (CHO) cell mutants defective in peroxisome biogenesis by an improved method using peroxisome targeting sequence (PTS) of Pex3p (amino acid residues 1–40)-fused enhanced green fluorescent protein (EGFP). From mutagenized TKaEG3(1–40) cells, the wild-type CHO-K1 stably expressing rat Pex2p and of rat Pex3p(1–40)-EGFP, numerous cell colonies resistant to...

متن کامل

C26232T Mutation in Nsun7 Gene and Reduce Sperm Motility in Asthenoteratospermic Men

Reduced sperm quantity and motility are primary causes of infertility in men. Before researchers showed that, Nsun7 gene has roles in sperm motility of mouse, that creation defect in this gene is cause infertility. This gene in human located in chromosome 4, with 12 exons and a hot spot exon (exon7). Our aim is study of the mutations of the exon7 in the normospermic and asthenoteratospermic men...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings. International Conference on Intelligent Systems for Molecular Biology

دوره 2  شماره 

صفحات  -

تاریخ انتشار 1994